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This paper presents the first direct estimate of the 3D clustering properties of far- 
infrared sources up to z ~ 3. This has been possible thanks to the Pacs Evolutionary 
Probe (PEP) survey of the GOODS South field performed with the PACS instrument 
onboard the Herschel Satellite. 550 and 502 sources were detected respectively in the 
^ ■ 100/im and in the 160/im channels down to fluxes <Sioo/im = 2 mJy and Si60/xm = 5 

' mJy cuts which ensure >80% completeness of the two catalogues. More than 65% 

of these sources has an (either photometric or spectroscopic) redshift determination 
from the MUSIC catalogue, this percentage rising to ~ 95% in the inner portion of 
GOODS South which is covered by data at other wavelengths. An analysis of the de- 
. projected two-point correlation function w(0) over the whole redshift range spanned 

by the data reports for the (comoving) correlation length, ro ~ 6.3 Mpc and ro ~ 6.7 
Mpc, respectively at 100/im and 160/im, corresponding to dark matter halo masses 
M > 10 124 Mq, in excellent agreement with previous estimates obtained for mid-IR 
selected sources in the same field. Objects at z ~ 2 instead seem to be more strongly 
clustered, with r ~ 19 Mpc and r ~ 17 Mpc in the two considered PACS channels. 
This dramatic increase of the correlation length between z ~ 1 and z ~ 2 is connected 
£2 | with the presence, more visible at 100/im than in the other band, of a wide (at least 4 

Mpc across in projection), M > 1O 14 M , filamentary structure which includes more 
than 50% of the sources detected at z ~ 2. An investigation of the properties of such 
sources indicates the possibility for boosted star-forming activity in those which reside 
within the overdense environment with respect of more isolated galaxies found in the 
same redshift range. If confirmed by larger datasets, this result can be explained as 
due to the combined effect of large reservoirs of gas available at high redshifts in 
deep potential wells such as those associated to large overdensities and the enhanced 
rate of encounters between sources favoured by their relative proximity. Lastly, we 
also present our results on the evolution of the relationship between luminous and 
dark matter in star- forming galaxies between z ~ 1 and z ~ 2. We find that the 
increase of (average) stellar mass in galaxies < M* > between z ~ 1 and z ~ 2 
is about a factor 10 lower than that of the dark matter haloes hosting such objects 
(< M* > 2 ~ 1 / < M* > z ~ 2 ~ 4 • lO^ 1 vs A/^ 7 M haJo ~ 4 • 10" 2 ). When compared 
with recent results taken from the literature, our findings agree with the evolutionary 
picture of downsizing wherby massive galaxies at z ~ 2 were more actively forming 
stars than their z ~ 1 counterparts, while at the same time contained a lower fraction 
of their mass in the form of luminous matter. 

Key words: galaxies: evolution - galaxies: statistics - infrared - cosmology: observa- 
tions - cosmology: theory - large-scale structure of the Universe 
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1 INTRODUCTION 

Investigations of the Large Scale Structure (LSS) traced by 
selected classes of extra-galactic sources, read in connection 
with more and more refined models for its characterization, 
has enabled in the recent years to derive important informa- 
tion on some physical properties characterizing the objects 
producing the clustering signal. One of these key quanti- 
ties is the halo mass and its evolution with look back time, 
which for instance allows to derive cosmological connections 
between different populations of extra-galactic sources ob- 
served in different wavelengths at different epochs. 

The development of these theoretical models for the de- 
scription of the LSS, together with the impressive achieve- 
ments reached in the past fifteen years or so by science 
instrumentation have allowed the scientific community to 
greatly broaden their knowledge on the formation and evo- 
lution of the cosmic structures, pushing it up to the dawn 
of galaxy formation. 

Within this framework, amongst the more interesting 
classes of sources are those which are undergoing some kind 
of active process such as AGN activity or star formation. 
Since the first high redshift observations of the two Hubble 
Deep Fields (Williams et al. 1996; Casertano et al. 2000), 
all clustering analyses performed on optical/near-IR sur- 
veys which were at the same time wide enough to ensure 
statistical significance and deep enough to probe high red- 
shift regimes, have highlighted a strong evolution of these 
two classes of sources, whereby activity shifts to smaller 
and smaller haloes/galaxies as one moves forward in time. 
The critical redshift for this transition seems to be around 
z ~ 2 for star-forming galaxies (e.g. Magliocchetti & Mad- 
dox 1999; Arnouts et al. 2002; Foucaud et al. 2010; Hartley 
et al. 2010) and z ~ 3 for QSOs (e.g. Porciani, Magliocchetti 
& Norberg 2004; Shen et al. 2007; Ross et al. 2009). Perhaps 
not surprisingly, these two redshift values bracket the peak 
of cosmic activity both in terms of AGN and star formation 
(Madau et al. 1996). 

Although clustering results in flux-limited surveys are 
somehow masked by the well known 'Malmquist bias' issue 
(more distant and therefore more luminous sources are more 
strongly clustered), some steps forward have been taken in 
order to minimize its effect on high redshift studies (e.g. 
Magliocchetti et al. 2008; Foucaud et al. 2010). These works 
all converge at indicating that massive galaxies form at high 
redshifts and on short timescales, while the sites of activity 
shift to lower mass systems at later epochs. This pattern, 
referred to as "downsizing" (Cowie et al. 1996; Heavens et 
al. 2004), is also observed in an increasing number of inde- 
pendent works (see e.g. Fontana et al. 2004; Treu et al. 2005; 
Cimatti, Daddi & Renzini 2006, Saracco et al. 2006; Bundy 
et al. 2006; McLure et al. 2006 just to mention a few), all 
adopting techniques which differ from clustering analyses. 

The advent of the Spitzer telescope has marked another 
milestone for what concerns our knowledge of the population 
of dusty (and therefore optically dim) active galaxies. Even 
in this case, the downsizing effect was confirmed for all ob- 
scured active sources up to z ~ 2.5 (e.g. Magliocchetti et al. 
2008) . In particular, for the first time Spitzer has detected a 
population of Ultra-Luminous Infrared Galaxies (ULIRGS, 
L > 10 12 L Q ) in the redshift range (1.6 < z < 2.5) with ex- 
tremely high clustering lengths (r ~ 15 Mpc - Farrah et al. 
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2006b; Magliocchetti et al. 2007; 2008; Brodwin et al. 2008). 
Taken at face value, these correlation lengths correspond lo- 
cally to those of groups-to-poor clusters of galaxies (see e.g. 
Guzzo et al. 2000; Estrada et al. 2009), and might be inter- 
preted with a roughly one-to-one correspondence between 
the overwhelming majority of ULIRGS at z ~ 2 and local 
clusters. 

The recent advent of the Herschel telescope allows the 
scientific community to further extend its knowledge on the 
formation and evolution of dusty sources up to very high 
redshifts. In fact, thanks to the fantastic capabilities of the 
PACS instrument, the GOODS South region has been ob- 
served at 70/im, lOOfim and 160/mi to unprecedented depths 
as a part of the PEP (PACS Evolutionary Probe, D. Lutz et 
al. submitted to A&A) survey. The majority of PEP sources 
in the GOODS-S is endowed with a redshift determination 
from the MUSIC catalogue (Santini et al. 2009) and this 
allowed us to estimate, for the first time in a self-consistent 
way, the 3D clustering properties of the population of far- 
infrared galaxies up to z ~ 3. Knowledge of the redshift 
distribution N(z) further allows to investigate the evolution 
of clustering with look-back time. In the light of what was 
discussed above, we have then also investigated the large- 
scale pattern traced by PEP galaxies at redshifts z ~ 2 and 
will discuss our findings in terms of abundances of super- 
structures or proto-clusters made of obscured star forming 
galaxies right at the peak of cosmic star formation activity. 

The work is organized as follows: §2 introduces the PEP 
catalogues and analyses some of the properties of the sources 
with the help of multiwavelength data coming from the MU- 
SIC catalogue. §§3 and 4 present the results for the two-point 
angular correlation function of all PEP sources spanning the 
whole z = [0 — 3] redshift range. §5 concentrates on the large- 
scale properties of far-infrared sources residing in the z ~ 2 
epoch which corresponds to the peak of cosmic star forma- 
tion activity. §6 provides a more detailed description of z ~ 2 
PEP sources, in particular focussing on the properties of an 
extended filamentary structure (or proto-cluster) made of 
100/im-selected galaxies at z ~ 2.2. §7 presents our results 
on the evolution of the relationship between luminous and 
dark matter in PEP-selected galaxies and compares them 
with recent works from the literature, while §8 summarizes 
our findings. 

Throughout this paper we assume a ACDM cosmology 
with H = 70 Km s _1 Mpc"\ fi = 0.27 and fi A = 0.73. 



2 THE DATA 

The GOODS South (GOODS-S) region has been observed 
by the PACS (Poglitsch et al. 2010) instrument onboard 
the Herschel Space Observatory (Pilbratt et al. 2010) as a 
part of the PACS Evolutionary Probe (PEP, D. Lutz et al. 
submitted to A&A) Survey, aimed at studying the properties 
and cosmological evolution of the infrared population up 
to redshifts z ~ 3 — 4. We refer to the Lutz et al. paper 
for further information on the survey and fields, including 
GOODS South, data reduction and catalogue creation. 

The total number of sources detected in the blind (i.e. 
without the use of priors) catalogues at the > 3<r confidence 
level is 375 at 70^im, 717 at lOO^rn and 867 at 160^im. The 
corresponding flux limits in the three wave-bands are ~ 1.2 



mjy, ~ 1.2 mjy and ~ 2.0 mjy. Given the relative paucity of 
sources in the blue/70/im channel which does not allow sen- 
sible clustering estimates, this paper will only concentrate 
on the two higher wavelengths, i.e. the 100/im and 160/im 
bands. 

GOODS-S is one of the best studied cosmological fields, 
having been observed in great depth at virtually all the avail- 
able wavelengths. The GOODS-MUSIC catalogue (Santini 
et al. 2009) provides a compilation of high quality multi- 
wavelength (from 0.3/im to 24/im) data for sources detected 
in the GOODS-S area in the z-, Ks-, and 4.5 /im-bands. 
The largest fraction of the sample is 90% complete at z~26 
or Ks~23.8 or 4.5um~23.5 (AB scale; Oke, 1970). The fi- 
nal catalogue comprises 15,208 sources, out of which nearly 
3,000 are endowed with a spectroscopic redshift determi- 
nation. Most of the remaining sources have assigned pho- 
tometric redshifts, estimated by following the technique of 
Grazian et al. (2006). 

The original ^ 3<r PEP catalogues at the two relevant 
frequencies were then cross-correlated with the MUSIC 
catalogue through a three-band maximum likelihood proce- 
dure (Sutherland & Saunders, 1992; see Berta et al. 2010 
for further details). This found respectively 475 (out of 717) 
and 536 (out of 867) IR-selected galaxies with a counterpart 
in MUSIC. The overwhelming majority of these objects has 
an assigned (either photometric or spectroscopic) redshift. 
In a handful of cases this was not possible due to the 
lack of mainly JHK band data at the edges of the area 
covered by MUSIC. We have also checked the accuracy 
of the photometric redshift estimates for our population 
of star-forming galaxies. The mean spread is found to be 
((zspec - %hot)/(l + «s P ec)) 0.076, only slightly larger 
than the value found by Santini et al. (2009) for the whole 
GOODS-MUSIC catalogue. By investigating this issue 
further, we found that such an increased value is mainly 
due to a handful (about 6) of outliers with either 2: spcc or 
Zphot beyond ~ 2. Indeed, if one removes these few sources, 
the mean spread in photometric accuracy is about 0.049, 
i.e. even smaller than that reported by Santini et al. (2009). 

Clustering studies need high completeness levels. This is why 
in the following analysis we will only concentrate on those 
sources belonging to the blind PEP catalogues which have 
PACS fluxes above the ~80% completeness levels. These lim- 
its, as estimated from simulations (cfr. Lutz et al. submitted 
to A&A) are 2 mjy at 100/^m and 5 mjy at 160/xm. Apply- 
ing the above flux cuts, the number of sources respectively 
in the green and red bands of the blind catalogues are 550 
and 502. These are the sources which will be used for the 
clustering studies. Their projected distribution onto the sky 
is shown in Figure [1] by the empty circles. 

Of these sources, respectively 343 at 100/im and 296 
at 160/im have a counterpart in the MUSIC catalogue, ob- 
tained as previously explained. Their distribution onto the 
sky is represented in Figure [T] by the crosses. From the two 
plots it is possible to appreciate that almost all PEP sources 
in the inner GOODS-S area have a counterpart in the MU- 
SIC catalogue, the apparent discrepancy between the total 
number of PEP-detected objects and of those which also 
have a counterpart in other wavebands simply due to the 
fact that the PEP survey covers a slightly wider area than 
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Figure 1. Projected distribution of sources in the GOODS-S down to ~ 80% completeness limit at lOO^tm (left-hand panel) and 160/^m 
(right-hand panel). Crosses indicate those sources which have a counterpart in the MUSIC catalogue (Santini et al. 2009), while the 
squares highlight objects which, although present in the MUSIC catalogue, lack of a redshift determination (either photometric or 
spectroscopic - see text for details). 
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Figure 2. Fraction of sources with either a spectroscopic or a photometric redshift determination in GOODS-S as a function of far-IR 
flux. The solid line represents PEP lOOfim observations, while the dashed one is derived at 160/xm. The left-hand panel refers to the 
whole area surveyed by PEP, while the right-hand one only concentrates on those PEP sources falling in the inner region of GOODS 
South which is also covered by MUSIC data. 



those at other frequencies. 

The number of PEP-MUSIC sources above the 80% com- 
pleteness level with an assigned redshift is 315 and 283, re- 
spectively at 100/im and 160/im. Those sources which do 
have a counterpart in MUSIC but for which it was not pos- 
sible to assign a redshift estimate are indicated in Figure [1] 



by the empty squares. As already discussed, they mostly lie 
at the edges of the area covered by MUSIC data which lack 
of JHK band data coverage. 

The ratio between the total number of PEP sources and 
of those with a redshift determination as a function of PACS 
fluxes is presented in Figure [2] by the histograms, solid line 
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We chose to use the estimator (Hamilton 1993) 
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Figure 3. Rcdshift distribution for the > 80% complete samples 
of PACS-selected sources in the GOODS-S. The solid line refers 
to the 100/xm sample, while the dashed line is at 160^tm. 



for the 100/im case, dashed one at 160/^m. The left-hand 
panel refers to all the sources detected by the PEP survey, 
while the right-hand one only considers those PEP-selected 
objects which fall within the area covered by MUSIC data. 
As it is possible to notice, the success rate for redshift de- 
termination is roughly constant with far-IR flux, at least for 
fluxes above the 80% completeness level (represented in the 
right-hand panel by the dotted vertical lines). Furthermore, 
the right-hand panel also indicates that in the common re- 
gion covered by both MUSIC and PEP observations, more 
than 90% of the PEP sources above the chosen flux limits 
are endowed with a redshift determination. We will discuss 
the implications of these results in §3.2 when dealing with 
conversions to spatial estimates of the angular correlation 
function. 

The redshift distribution for those PEP-selected sources 
with a counterpart in the MUSIC catalogue and with PACS 
fluxes above the 80% completeness level is presented in Fig- 
ure |31 The distributions at the two frequencies are virtually 
identical, with a first peak around z ~ 0.7, a secondary peak 
at z ~ 1.1, then a rapid decrease before presenting a wider 
bump between redshifts 2 and 3. The two primary peaks 
correspond to the two well studied clusters at z = 0.73 and 
z = 1.1 (e.g. Gilli et al. 2003; Vanzella et al. 2005). 



3 CLUSTERING ANALYSIS 

3.1 The Angular Correlation Function 

The angular two-point correlation function w(8) gives the 
excess probability, with respect to a random Poisson distri- 
bution, of finding two sources in the solid angles 5Qi SH2 
separated by an angle 9. In practice, w(0) is obtained by 
comparing the observed source distribution with a catalogue 
of randomly distributed objects subject to the same mask 



w(0) = 4 x 



DD_RR 

{DRY 



1, 



(1) 



where DD, RR and DR are the number of data-data, 
random-random and data-random pairs separated by a dis- 
tance 9. 

Given the > 80% completeness of both the 100/im and 
160/im considered samples, in the specific case of PACS- 
selected sources we have simply generated random cata- 
logues with twenty times as many objects as the real data 
sets, covering the whole surveyed GOODS South area minus 
the outermost regions which showed irregular data coverage. 
Within the same considered area, the number of Sioo^m ^ 2 
mjy and Sieo^m ^ 5 mJy PEP-selected sources is respec- 
tively 519 and 470. w(9) in eq. JT} for the above objects 
was then estimated in the angular range 10~ 3 < 8 < 0.1 
degrees. The upper limit was determined by the geometry 
of the GOODS-S field, and corresponds to about half of the 
maximum scale probed by the survey. 

Figure 4 presents our results for the two lOO^tm-selected 
(4a; left-hand panel) and 160^im-selected (4b; right-hand 
panel) samples. The error bars show Poisson estimates for 
the points. Since the distribution is clustered, these esti- 
mates only provide a lower limit to the uncertainties. How- 
ever, it can be shown that over the considered range of angu- 
lar scales this estimate is close to those obtained from boot- 
strap resampling (e.g. Willumsen, Freudling & Da Costa, 
1997). Due to the angular resolution of the Herschel mir- 
ror at the two considered wavelengths (respectively ~ 8" 
and ~ 13" at 100/xm and 160/xm) which would cause source 
blending on lower angular scales, the two results in the small- 
est 9 bins of the angular correlation function merely rep- 
resent lower limits and therefore have been indicated with 
upward arrows. 

If we then assume the standard power-law form for 
w(9) = A9 1 ', we can estimate the parameters A and 7 
by using a least-squares fit to the data. Given the large er- 
rors on w, we chose to fix 7 to the standard value 7 = 1.8. 
Although somewhat arbitrary, this figure and its assumed 
lack of dependence on redshift is justified by Large Scale 
Structure observations of large enough samples of high red- 
shift sources so to allow for a direct estimate of the slope 
of w(9) at different look back times (e.g. Porciani, Maglioc- 
chetti & Norberg 2004; Le Fevre et al. 2005). The small area 
of the GOODS-S survey introduces a negative bias through 
the integral constraint, J w est dQ = 0. We allow for this by 
fitting to A9 1 " 1 - C, where C = 1.78 A 
The dashed lines in Figures 4a and 4b represent the best 
power-law fits respectively to the 100/xm and 160/im data- 
sets. The associated best-fit values for the amplitude are 
A wov,m = j-j 2 ± q 4j . 1Q -3 for the ioo M m-selected sample, 

and A 60Mm = [1.2 ± 0.5] ■ 10" 3 for the 160pm-selected one, 
i.e. the results at the different wavelengths are virtually iden- 
tical. 

As a further consistency check, we have released the 
constraint on 7 and let it vary between 1.0 and 2.5. Despite 
the expected degeneracy between the best-fit values of A 
and 7 determined by this choice, we find that in the case 
of the 100/im-selected sample slopes shallower than 1.3 are 
ruled out by our data at the > 95% confidence level. 7 = 1.8 
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Figure 4. Angular correlation function for PACS-selected sources above the 80% completeness level in the GOODS-S at 100/xm and 
160^tm. The dashed lines indicate the best power-law fits to the data according to equation J2j, with ro = 6.3 Mpc and ro = 6.7 Mpc, 
respectively at 100/xm and WOfim, while the dotted lines indicate the best-fits to the data according to equation JJJ with M m ; n = 
10 12,4 Mq at both sampled wavelengths. Upward arrows correspond to the lower limits in the estimates of w(9) in the lowest angular 
scale intervals due to instrument resolution (see text for details). 



(and the corresponding value for A quoted above) indeed 
provides the best fit to the observed w(0), even though 
any value in the range 1.3 ^ 7 ^ 2.2 can be accepted 
within la confidence level. The corresponding ranges of 
variability for the amplitude are J 4 10 ° Mm ~ [0.2-19.0] • 10" 3 . 
The 160/xm-selected sample (for which, however, w{9) is 
less constrained due to a worse angular resolution of the 
corresponding PACS channel) instead seems to prefer 
higher values of 7. This is due to the observed w(9) at the 
smallest angular scales probed by our analysis and possibly 
hints at multiple occupancy for the dark matter haloes 
inhabited by such galaxies (cfr §4). The best-fit value is 
found for 7 2.5 (outside the chosen range of variability 
for 7), with a ler-acceptance that goes down to 7 = 1.9. 
The corresponding ranges of variability for the amplitude 
are A 160,lra ~ [« 0.6) - 8] ■ 10~ 4 . 

The estimated amplitudes for these samples (under the 
assumption of 7 = 1.8) are in good agreement with the 
results obtained for Spitzer-selected sources. More specifi- 
cally, they closely follow those obtained by Magliocchetti et 
al. (2007) for the whole population of F24^m *5 0.35 mjy 
objects from the First Look Survey (A — [9 ± 2] ■ 10~ 4 ), and 
also those obtained by Fang et al. (2004) for IRAC, 8^im- 
selected galaxies in the same field. Lastly, we also note that 
the low clustering signal obtained by Maddox et al. (2010) 
for galaxies selected at 250pm within the Herschel- ATLAS 
consortium is compatible with our PEP results. 



3.2 Relation to spatial quantities 

The standard way of relating the angular two-point correla- 
tion function w(8) to the spatial two-point correlation func- 



tion £(r, z) is by means of the relativistic Limber equation 
(Peebles, 1980): 



w(9) 



JT JT F- 2 {x)x A 9> 2 {x)^r, z)dx du 
[J^° F- 1 (x)x 2 ^>(x)dx] 2 



(2) 



where x is the comoving coordinate, F{x) gives the correc- 
tion for curvature, and the selection function <fc(x) satisfies 
the relation 



N = 



poo 1 pc 

/ $(x)F- 1 (x)x 2 dx = — / 
Jo ils Jo 



N(z)dz, 



(3) 



in which M is the mean surface density on a surface of solid 
angle Q s and N(z) is the number of objects in the given 
survey within the shell (z, z + dz). 

If we adopt a spatial correlation function of the form 
£(r, z) = (r/ro) -1 ' 8 , independent of redshift in the consid- 
ered intervals, and we consider the redshift distributions pre- 
sented in Figure [3] for the adopted cosmology we obtain 
via a ^-square fit to the data: t^ 00 '" 11 = 6.3j^'g Mpc and 
rg 6 ° Mm = 6.7l}'y Mpc (where both quantities are comov- 
ing), respectively for the 100/im and 160/im samples. 

It is worth noting that, while w{6) has been obtained 
over the whole area covered by PACS observations of the 
GOODS South, the adopted redshift distributions only re- 
fer to a sub-sample of ~65% of the PEP sources which are 
located in the (smaller) region where PEP and MUSIC data 
co-exist (cfr. Figure[l}. In principle, such a discrepancy could 
produce biases in the determination of ro. However, as seen 
in §2, the ratio between the number of PEP sources endowed 
with a redshift determination and that of the whole PEP 
sample shows a general independence of far-IR flux. Fur- 
thermore, this trend is the same both in the inner and outer 
regions of the PEP-GOODSS survey (cfr. Figure [2). These 
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findings provide insurance on the reliability of the adopted 
N(z) and therefore on the goodness of the estimated ro. 

As a further consistency check, we have also considered 
the MUSYC catalogue of photometric redshifts derived by 
Cardamone et al. (2010) over the whole Extended Chandra 
Deep Field-South region and assigned via a nearest neigh- 
bourhood procedure a photometric counterpart to our PEP 
sources detected in the blind catalogues. By then adopting 
the redshift distributions originating from this different ap- 
proach, we find for the correlation lengths rJ ° Mm ~ 6.2 Mpc 
and 7^ 6 ° Mm ~ 6.7 Mpc, i.e. the results are virtually equiva- 
lent to those obtained by considering the redshift distribu- 
tions derived from the MUSIC catalogue. 

The above results indicate that the clustering proper- 
ties of far-infrared galaxies closely follow those found for 
24/im-selected sources. Gilli et al. (2007), while analysing a 
sample of SW^on ^ 20/iJy in the same GOODS South field 
we are considering in this work, report a very similar clus- 
tering length (ro ~ 7.3 Mpc) as ours for their sub-class of 
Luminous Infrared Galaxies (LIRG) with L > 1O 11 L0. No- 
ticeably, also the number densities of their 24/im-selected 
sources and of our far-infrared galaxies coincide, leading us 
to conclude that these two classes of objects refer to the 
same galaxy population. It is also worth noticing that Gilli 
et al. (2007) find a dip in the trend of the real-space cor- 
relation function on scales ~ 1 Mpc which, at the median 
redshift (z ~ 1) of their survey, corresponds to those angu- 
lar scales where our data also seem to indicate lower-than- 
expected values for w{6). The above clustering properties for 
far-infrared bright galaxies locally mirror those of 'normal' 
early-type galaxies (e.g. Madgwick et al. 2003; Zehavi et al. 
2005) and it is therefore sensible to envisage an evolution- 
ary connection between these populations whereby the bulk 
of far-IR selected sources at z ~ 1 will eventually evolve in 
passive ellipticals in the nearby universe. 

All the information on the adopted samples, correlation 
function results, number densities etc. can be found summa- 
rized in Table 1. 



4 CONNECTION WITH PHYSICAL 
PROPERTIES: THE HALO BIAS 
APPROACH 

The easiest way to relate the clustering trend of a population 
of extra-galactic sources with their physical properties is by 
means of the Halo Bias approach (Mo & White 1996) which 
describes the observed clustering signal as the product be- 
tween the two-point correlation function of the underlying 
dark matter distribution £d m (r, z) and the square of the so- 
called bias function &(M m i n ,.z), quantity which at a given 
redshift solely depends on the minimum mass M m i n of the 
haloes in which the detected astrophysical objects reside via 
the relation: 

f (r, z) = f dm (r, z) ■ b(M mln , zf. (4) 

Starting from eq.Q, the theoretical angular two-point cor- 
relation function w(0) t h predicted by the halo bias model 
for PACS-selected galaxies at 100/im and 160/jm has then 
been computed once again by following eq.© and the best 
values for M m j n at the two wavelengths have been found 
by a x 2- fit to the observed clustering signal. This gives 



LogM min = 12.4j£j?M Q and LogM min = 12.4t°;*M Q , re- 
spectively for the 100/im and 160/im samples. The corre- 
sponding best-fit curves are shown in Figures 4a and 4b by 
the (blue) dotted lines. 

More sophisticated models have been introduced in the 
recent years in order to provide a more realistic descrip- 
tion of the observed clustering properties of extra-galactic 
sources. One of the most successful ones, the so-called Halo 
Occupation Model (HOM, e.g. Scoccimarro et al. 2001) re- 
lates the clustering properties of a chosen population of 
galaxies with the way such objects populate their dark mat- 
ter haloes, i.e. relaxes the somehow unrealistic assumption, 
implicit in the halo bias model, of having a one-to-one cor- 
respondence between a galaxy and its dark matter halo. In- 
deed, from a quick glance at Figures 4a and 4b, it is possible 
to notice how the more simple halo bias model ([4]) cannot 
reproduce the observed rise in the angular correlation func- 
tion on small angular scales (6 < 0.05 deg), where the signal 
is dominated by pairs of galaxies residing within the same 
dark matter halo. However desirable, such an analysis is un- 
fortunately not possible on our dataset due to relatively poor 
signal-to-noise ratio. In fact, the HOM is a three parameter 
model: together with the minimum mass of the dark matter 
halo it also depends on the the quantities a and iVo accord- 
ing to the expression 

N(M) = N {M/M mln ) a if M > M min , 

where N(M) is the number of galaxies within a halo of cer- 
tain mass M. And a three-parameter fit will only be possible 
when larger fields such as COSMOS and the Lockman Hole 
will be included in the analysis. For the time being, we no- 
tice that the quantities M m i n and a are covariant, so that 
within the HOM framework, higher values for a in general 
correspond to lower values for M mm . This implies that in 
the presence of multiple halo occupancy, the values for M m i n 
found within the HOM scenario will be (slightly) lower than 
those obtained via the halo bias model adopted in this work. 

Despite the above warnings and bearing in mind that 
so far all clustering results at the SPIRE wavelengths have 
only been derived by using theoretical redshift distributions, 
it is however interesting to note that the resulting values for 
Mmi n are in good agreement with those found by Cooray et 
al. (2010) in their 250/im-selected sample obtained by com- 
bining the Lockman-SWIRE and FLS fields, possibly con- 
firming the fact that the imaged sources belong to the same 
underlying population of galaxies as ours. We also note that 
similar results have also been obtained by Magliocchetti et 
al. (2008) for 0.6 < z < 1.2 galaxies with S 2 ^m > AOOfiJy 
taken from the UKIDSS survey, once again indicating that 
the bulk of sources producing the clustering signal at the 
PACS wavelengths coincides with z ~ 1 Spitzer-selected 
galaxies. 



5 LARGE-SCALE STRUCTURE AT Z ~ 2 

Magliocchetti et al. (2007; 2008) found that Spitzer/24/im- 
selected galaxies residing at redshifts z ~ 2 exhibited an ex- 
ceptionally strong clustering signal (ro ~ 16 Mpc). Their re- 
sults were also found to agree with the estimates of Farrah et 
al. (2006a; 2006b) for ultra-luminous infrared galaxies over 
the 1.5 < z < 3 range. The above figure, obtained for galax- 
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Figure 5. Angular correlation function of PACS-selected, z = [1.7 — 2.6], sources with fluxes above the 80% completeness level in the 
GOODS-S at 100/xm (left-hand panel) and 160/im (right-hand panel). The dashed lines indicate the best power-law fits to the data 
obtained for ro = 19.0 Mpc and rp = 17.4 Mpc, respectively at 100/tm and 160/xm, while the dotted lines indicate the best-fit to the data 
according to equation Q with M m ; n = 1O 13 8 M0 (left-hand panel) and M m ; n = 1O 13 7 M0 (right-hand panel). The long-short dashed 
lines in both panels represent the best fits to the data presented in Figure 4. 
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Figure 6. Distribution of z = [1.7 — 2.6] sources in the GOODS-S with fluxes above 80% completeness limit (100/im on the left-hand 
side, 160/^m on the right-hand side). The small dots represent the whole PEP population brighter than the same flux cuts. 



ies undergoing an intense phase of dust-enshrouded star for- 
mation (in the redshift interval 1.7 < z < 2.6, the strong 
spectral PAH feature centred at 7.7pm - indicative of a 
very intense star- forming activity - falls in the 24/im band) , 
was also in agreement with earlier results obtained for star- 
forming galaxies selected at z ~ 2 in the (rest-frame) UV- 
band in both Hubble Deep Fields (Magliocchetti & Maddox 



1999; Arnouts et al. 2002). Although with larger uncertain- 
ties, sub-millimeter-selected galaxies - mainly found in the 
considered redshift range - also seem to cluster with a sim- 
ilar strength (Weiss et al. 2009). More recently, Hartley et 
al. (2010) use the UKIDSS Ultra Deep survey to investigate 
the clustering properties of passive and star-forming galax- 
ies and find that, while locally these two classes of objects 
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Table 1. Properties of sources in the blind GOODS-S at and 160/im: a) total number above 80% completeness level, in brackets 

those used for the clustering analyses; b) amplitude A of the angular two-point correlation function w(8); c) comoving correlation length 
ro; d) minimum dark halo mass M m i n ; number density 



Number sources A ro [Mpc] LogM m ; n [Mq] number density [Mpc 3 ] 



lOO^tm; all z 
lOO/xm; 1.7 < 2 < 2.6 


550 (519) 
33 (33) 


(1.2 ±0.4) x 10" 
(2.7 ±0.7) x 10" 


-3 
-2 


6 3+ 1 ' 1 

"•°-1.3 
19-012.9 


12.4«; 2 5 
13.8«; 2 3 


(2.0 ±0.1) x 10" 
(7.4 ± 1.3) x 10" 


-3 

-5 


160/xm; all z 
160Atm; 1.7 < Z < 2.6 


502 (470) 
32 (32) 


(1.2 ±0.5) x 10~ 
(2.0 ±0.6) x 10~ 


-3 
-2 


6.7+J-? 
17 4+2-8 


12 4 +0 - 4 
-,0 7 +0.3 


(1.7 ±0.1) x 10" 
(7.2 ± 1.3) x 10" 


■3 

-5 



exhibit quite a different behaviour, with the former one be- 
ing about twice as more clustered than the second one, at 
redshifts ~ 2 star-forming and passively evolving galaxies 
have similar clustering strengths, with values for the cor- 
relation length in excellent agreement with those reported 
above (ro ~ 14 Mpc). We note that at lower redshifts, such 
a strong clustering signal finds a counterpart only in the 
case of radio sources (see e.g. Magliocchetti et al. 2004) and 
early type, L > 4 L* galaxies (see e.g. Norberg et al. 2002), 
and is only second to those of rich clusters of galaxies (e.g. 
Guzzo et al. 2000). It is therefore natural to envisage a con- 
nection between these latter objects and the high-z sources, 
whereby massive galaxies undergoing intense star-formation 
at redshifts 2 > 2 end up as the very bright central galaxies 
(passive objects with a high probability for enhanced radio 
activity, see e.g. Best et al. 2007; Magliocchetti & Briiggen 
2007) of local clusters. 

Redshifts around 2 therefore seem very interesting not 
only for what concerns the study of the assembling of the 
stellar content of galaxies (it is in fact at those redshifts 
that the cosmological star formation rate peaks - Madau et 
al. 1996, see also Gruppioni et al. 2010 for a more updated 
version of the Madau plot), but also with respect to the 
formation and evolution of extremely large structures such 
as groups and clusters of galaxies. It is for this reason that 
we have decided to isolate in the 100/xm and 160pm PEP 
catalogues sources in the redshift range 1.7 ^ z ^ 2.6 - so 
to mimic the 24/im selection of galaxies with intense star 
forming activity and therefore strong PAH lines - and have 
then studied their Large-Scale properties. The number of 
sources in the PEP/GOODS South catalogues within the 
above redshift interval and with fluxes above the requested 
cuts which ensure > 80% completeness is 33 in the 100/im 
case and one less at 160/wn. The angular correlation function 
of these objects has then been calculated as in §3.1. 

The corresponding results with associated (Poissonian) 
uncertainties are plotted in Figure O where once again 
those referring to the 100/im band are found on the left- 
hand panel. The short-long dashed lines indicate the best- 
fit to the angular correlation functions obtained for the 
whole PEP samples, both at 100/xm and at 160/mi (Fig- 
ure 4 and Table 1). Despite the large uncertainties, com- 
parisons with these latter results clearly show that z ~ 2 
PEP galaxies are extremely strongly clustered, with an am- 
plitude of about a factor 20 higher than what found for 
PEP sources at all redshifts. A more quantitative analysis 
performed by assuming a power-law form for w = AO^- 1 '\ 
with 7 = 1.8 (see §3), returns the values for the amplitude 
A ioo M m = j 2 7 ± g 7 j . 1Q -2 for the iQO^m-selected sample, 



and A 160 " 1 " = [2.0±0.6] ■ 10~ 2 for the 160/xm-selected one. In 
terms of correlation lengths (cfr. eq. [2} , this corresponds to 

r 100 M m = 19 + 2.6 Mp( , an(J r Ui0„m = 17.4+2.8 Mpc rpj^ 

values (which we stress are obtained by adopting a top-hat 
distribution for N(z) which does not include the (small, cfr 
§2) errors associated to the photometric redshift estimates) 
are extremely high, possibly even higher than those obtained 
in the works summarized at the beginning of this section in 
the case of star- forming galaxies at redshifts z ~ 2. We also 
note that according to the halo bias approach (equation [4}, 
such very high clustering strengths imply z ~ 2 PEP sources 
to be hosted in dark matter haloes of masses respectively 
greater than 13.8lo:3-^© and 13.71o;4Mq at 100 nm and 
160/im, i.e. these sources reside in super-massive/cluster- 
like structures which are already found in place at z ~ 2. 
It is also interesting to stress that the above values agree 
with the early estimates of Daddi et al. (2001), ro ~ 12 — 24 
Mpc, obtained for a handful of spectroscopically determined 
z ~ 2 GOODS-S galaxies. 

A visual investigation of the distribution of PEP- 
selected galaxies in the z — [1.7 — 2.6] interval (cfr. Figure 
[6| shows that this is indeed the case. Out of 33 sources with 
Sioomhi ^ 2 mjy, 18 of them, corresponding to ^55% of the 
total, reside in a huge, filamentary structure, about 4 Mpc 
across in projection, centred at z ~ 2.2. A similar behaviour, 
although less pronounced (which explains the slightly lower 
value obtained for the correlation length in this latter case) 
is also visible at 160/im. We note that a forming, diffuse, 
super-structure at z ~ 2.2 was already identified by Salim- 
beni et al. (2009), while studying the properties of overden- 
sities in the GOODS South. It is however remarkable the 
way this overdensity shows up in such a clean way and with 
such a high significance in a numerically much smaller sam- 
ple which is uniquely made of dusty star-forming galaxies. 

We know however that the GOODS South seems to be a 
very peculiar field, already dominated by a couple of clusters 
around redshifts 0.7 and 1 (e.g. Gilli et al. 2003; Adami et al. 
2005; Vanzella et al. 2005; Daz-Sanchez et al. 2007). So, it is 
sensible to state the statistical significance of our findings. 
Straight integration of the Sheth & Tormen (1999) dark halo 
mass function in the considered z — [1.7 — 2.6] redshift range 
indicates that the number of haloes with masses greater than 
10 13 ' 5 Mq, compatible with our findings, on an area like that 
subtended by the GOODS-South is about 14. This corre- 
sponds to the 15, 100/im-selected, real 'field' (i.e. not belong- 
ing to the filamentary super-structure) galaxies observed 
with Herschel-PACS with Sioo/im ^ 2 mjy. From the above 
figures we can then draw a first, interesting, conclusion: ev- 
ery dark matter halo which reaches the mass necessary to 
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host a (dusty) star-forming galaxy at redshift z ~ 2 will be 
inhabited by (at least) one of such objects. In other words, the 
above statement indicates that star-forming galaxies at red- 
shifts around 2 are common events, their observed paucity 
(number densities of the order of a few 10~ 5 Mpc, cfr. Table 
1) simply due to the paucity of high mass dark matter haloes 
at the considered redshifts. In a similar fashion, we can also 
conclude that, for what concerns the spatial distribution of 
z ~ 2 dust-obscured galaxies, GOODS South is not a par- 
ticularly peculiar field as it comprises (within Poissonian 
uncertainties) a number of sources which is compatible with 
theoretical expectations. 

Similar back-of-the-envelope calculations further show 
that the number density associated to the projected distri- 
bution of Sioojtm *5 2 mjy proto-cluster sources is about 
2 x 10~ 4 Mpc, with a <$ P rotociu S tcr/<5ficid — 5, i.e. as already 
expected, the overdensity at z ~ 2.2 , is not yet virialized 
(<5 v ir ~ 178 for a ACDM universe; Peebles 1980) but will 
likely evolve in a bound structure. 



6 THE Z ~ 2.2 OVERDENSITY 

The z ~ 2.2 filamentary structure singled out in the high 
redshift clustering analyses has an undoubtable relevance 
per se as it provides one of the first clear examples of forma- 
tion of super-structures made of star-forming galaxies right 
at the peak of cosmic star-forming activity (for a further 
example cfr. Andreani, Magliocchetti & de Zotti 2010). It is 
therefore worth concentrating some more on this structure 
by analyzing its members and their properties as compared 
to those of similar galaxies residing in less dense environ- 
ments. 



In order to do that, we have released the request for 
>80% completeness and considered all PEP/100/im-selected 
sources within the 1.7 ^ z ^ 2.6 redshift interval which in 
projection appear to belong to this diffuse overdensity. The 
number of objects fulfilling the above requirements is 28 
and they are indicated in Figure [JJ with (red) empty cir- 
cles. Thirteen of these galaxies have a spectroscopic red- 
shift determination from the literature and are highlighted 
in Figure [7] with different symbols. More in detail, three of 
them, distributed along the whole length of the filamentary 
structure, have z ~ 2.4 (filled red dots), six of them have 
z ~ 2.1 — 2.2 and are all clustered in the left-hand part of the 
structure (filled blue squares), while respectively two+two 
sources have redshifts around 2.0 and 1.7, the former pair 
residing in the outermost right-hand part of the overden- 
sity (green triangles), while the other, probably not really 
belonging to the diffuse super-structure, is spread along the 
proto-cluster (magenta stars). It therefore seems that we are 
in the presence of a very diffused/possibly forming super- 
structure at z ~ 2.4 'bracketed' at the two ends of its length 
by two more concentrated clumps set at a median redshift 
z ~ 2.1 (with a scatter Az ~ 0.1 - 0.2). 

In order to investigate the influence of environment 
on star-forming activity at redshifts ~ 2, we have then 
estimated masses and bolometric luminosities for the 25 
1-9 2 J; 2.6 sources residing within the area identified by 
the super-structure and then compared these latter quanti- 
ties with those obtained for more isolated or 'field' galaxies 
(28 objects), otherwise selected from the 100/im-PEP sam- 
ple with the same criteria as the sources belonging to the 
overdensity. 

Stellar masses have been derived according to the 
method described in Rodighiero et al. (2007; 2010) which 
fits the UV-to-5.8/im broad band photometric data with the 
Bruzual & Chariot (2003) stellar population synthesis mod- 
els, assuming a Salpeter (Salpeter 1955) Initial Mass Func- 
tion (IMF) and a Calzetti extinction law (Calzetti et al. 
1994), once the redshift of each source is fixed to its spec- 
troscopic or, whenever unavailable, photometric value. For 
GOODS-S the stellar masses are also available from San- 
tini et al. (2009) and are calculated with the same synthesis 
models and very similar recipes as in Rodighiero et al. (2007; 
2010). 

Bolometric luminosities Lboi have instead been com- 
puted by using two different methods. The first one, pre- 
sented in Rodighiero et al. (2010), fits the optical-to-FIR 
SEDs with a library of local templates, including those 
of Polletta et al. (2007), and a few additional modified 
templates with colder FIR emission (see Gruppioni et al. 
2010 for further details) at a given redshift. Lboi was then 
obtained by integrating the best-fit SEDs over the whole 
[8 — 1000] ^tm rest-frame range. The second method instead 
follows that presented in Santini et al. (2009). Luminosities 
in this case have been derived by fitting the 100/im observa- 
tions to the Dale & Helou (2002) synthetic templates. First, 
a total infrared luminosity in the [8 — 1000] ^im range was 
assigned to each template by using the empirical relation of 
Marcillac et al. (2006) between Lboi and the predicted ratio 
between fluxes at 60/xm and 100/xm. Once the k-correction 
was applied, we then selected the model which best repro- 
duces the observed 100/im luminosity and normalized it by 
using the flux difference between the model and the observed 
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PEP/100/mi-selected galaxies in the redshift range 1.9 < z < 2.6. The solid line represents galaxies belonging to the proto-clustcr 
structure, the dotted line the others ('field' galaxies). 



galaxy. These two methods produce virtually identical re- 
sults, so in the following we only present the outcomes of 
the Santini et al. (2009) method. 

Figure [8] presents the distribution of 100/im fluxes (left- 
hand panel), masses (middle panel) and luminosities (right- 
hand panel) for the galaxies belonging to the filamentary 
2 ~ 2.2 overdensity (solid line) as compared to those of more 
isolated galaxies residing within the same redshift range 
(dotted line). As it is possible to appreciate, while the dis- 
tribution of masses in the two cases follows a similar trend, 
'field' galaxies seem to be on average fainter than sources 
belonging to the overdensity, both in terms of 100/im flux 
and bolometric luminosity. If we then convert bolometric lu- 
minosities into estimates for the star formation rate (SFR) 
via the relation SFR[M /yr] = 1.8 x 10~ 10 L bol /L© (Ken- 
nicutt et al. 1998) , it follows that Figure [8] also provides 
a snapshot for the star formation activity within and out- 
side the enhanced-density area. By doing this, and despite 
the relative small statistics, we find that the sources within 
the filamentary structure have an enhanced star formation 
activity (peak at ~ 3OOM0/yr) with respect to more iso- 
lated galaxies, which exhibit a peak in their SFR at about 
180A/Q/yr. We are therefore in the presence of boosted star- 
formation activity in the over-dense region, probably due to 
the joint effect of large reservoirs of gas available at high red- 
shifts in deep potential wells such as those associated to large 
overdensities and the enhanced rate of encounters between 
sources favoured by their proximity. Larger datasets are ob- 
viously needed to provide the ultimate answer on this issue. 
This result is in agreement with the findings of Popesso et al. 
(2010) who observe an inversion of the relationship between 
environmental density and star formation rate at around 
redshift 1 with respect to the local results, where SFR is 
depressed in overdense regions. 



7 ON THE REDSHIFT EVOLUTION OF THE 
LINK BETWEEN LUMINOUS AND DARK 
MATTER 

Since GOODS-South has been observed at a great wealth 
of frequencies, important information such as mass in 
stars, bolometric luminosity and star formation rate for the 
sources which populate the area can be obtained. A com- 
bination of these properties with the results inferred from 
the clustering analysis on the mass of their dark matter 
haloes can then provide interesting constraints on the re- 
lation between - say - luminous and dark matter in star 
forming galaxies and its eventual dependence on look-back 
time. Given the extreme similarity in the clustering results 
obtained in this work between the 100/im and 160/im chan- 
nels (cfr. Table 1), and the better determination of the rele- 
vant parameters derived in the former case due to its higher 
angular resolution, in the following we will only concentrate 
on galaxies selected at 100/im. 

Stellar masses and star formation rates were then de- 
rived for all sources detected by PEP at 100/im in the 
GOODS-South field endowed with a reliable (either pho- 
tometric or spectroscopic) redshift determination. This was 
done as described in §6. Furthermore, in order to ensure 
both completeness of the sample and consistency with the 
clustering results, we have restricted our analysis to galax- 
ies with 100/im fluxes brighter than 2 mJy. Finally, in order 
to investigate possible redshift dependencies of the consid- 
ered quantities, we have considered two different samples: 
a first one which comprises galaxies in the 0.7 ^ z ^ 1.2 
range (iV g ai = 82 with both stellar mass and luminosity 
estimates), and a second one which focuses on sources at 
higher, 1.7 ^ z ^ 2.6, redshift (-/V ga i = 32 with both stellar 
mass and luminosity estimates). 

Future works will address in more detail the properties 
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of the sources detected by the PEP survey in GOODS-S. 
We here discuss average stellar masses and star formation 
rates of these two samples in conjunction with the minimum 
dark matter halo mass M m i n as determined from clustering 
results. 

The mean redshifts of the two considered samples are < 
z >= 0.94 and < 2 >= 2.19. Average stellar masses, bolo- 
metric luminosities and star formation rates are found to 
be < M» >= 1O 1O ' 6±O ' 4 M , < L bo i >= 1O 11 ' 4±O ' 3 L and 

< SFR >= 52 ± 35 M /yr for the intermediate-redshift 
sample and < M* >= 1O 11±O ' 3 M , < L hol >= 1O 12 ' 3±O ' 2 L 
and < SFR >= 444 ± 210 M /yr for the high-z one. 

Despite the uncertainties due to the relatively small 
number of sources considered in the analysis, the above fig- 
ures suggest that by moving towards the present universe 
not only - as expected for a flux-limited survey - we are se- 
lecting less luminous and actively star-forming galaxies, but 
also that on average such galaxies have smaller stellar masses 
than their high-redshift counterparts. As we saw in §§4 and 
5, this is also true when we consider the masses of the dark 
matter halos which host such objects. However, the increase 
of mass with redshift in this latter case seems to be more 
pronounced. Indeed, while for stellar masses we find a ratio 

< M,(< z >= 0.94) > / < M*(< 2 >= 2.19) >~ 0.4, 
for the minimum mass of dark matter haloes we obtain 
M min (< z >~ 1)/M min (< z >= 2.19) ~ 0.04 (cfr. Table 
1), i.e. a value which is about ten times smaller. 

We note that, although for the calculation of M m i n (< z >~ 
1) we have technically used the whole sample of 100/im- 
selected galaxies brighter than 2 mjy, the presence of the two 
very marked peaks in their redshift distribution at z ~ 0.7 
and z ~ 1.1 (cfr. Figure 3) effectively ensures that the ob- 
served clustering signal is dominated by sources which re- 
side at redshifts which are close to those of the two peaks 
and therefore that the measured values for M m j n closely re- 
semble those that would be obtained for a redshift-limited, 
0-7 < z < 1.2, sample such as that considered in this section. 

A number of recent works have focussed on both theo- 
retical (e.g. Moster et al. 2010) and observational (e.g. Fou- 
caud et al. 2010; Leauthaud et al. 2011; Wake et al. 2011) 
aspects of the relationship between stellar and total mass in 
galaxies as a function of both redshift and galaxy masses. 
The observational works deal with clustering analyses and 
their results can therefore be compared with what we ob- 
tain in the present paper. In more detail, both Foucaud et 
al. (2010) and Wake et al. (2011), despite the different se- 
lection criteria (galaxies selected in the near-IR rather than 
in the far-IR regime as it is our case) , span a redshift and a 
stellar mass range which roughly overlap with ours. Foucaud 
et al. (2010) find for galaxies with stellar masses respec- 
tively within the values 1O 1O ' 5 M < M» < 1O 11O M and 
IO^Mq sC M» < 1O 12 M at < z >= 1 and < z >= 1.73 
values for the number density n g and clustering length ro, 
n g (< z >= 1) = (9 ± 1) • 10" 4 Mpc" 3 , r (< z >= 1) = 
8.0 ± 1.4 Mpc and n g (< z >= 1.7) = (6 ± 2) ■ 10~ 5 
Mpc -3 , ro(< z >— 1.7) = 22 ± 9 Mpc, in very good agree- 
ment with our findings. The same can be said of the Wake 
et al. (2011) results who find in their < z >= 1.1 and 

< z >— 1.9 redshift bins respectively n 9 (< z >= 1.1) ~ 
2 • 10~ 3 Mpc~ 3 , r (< 2 >= 1.1) = 8.0 ± 0.5 Mpc and 
n g (< z >= 1.9) ~ 8.5-10 -5 Mpc" 3 , r (< z >= 1.9) = 16±2 



Mpc for stellar masses M*/M (< z >= 1) > 7 • 10 9 and 
M,/M Q (< 2 >= 1.9) > 10". 

The concordance between the above results and ours 
also implies that our findings on the relationship between lu- 
minous and dark matter, i.e. the ratio M*/M m i n at redshifts 
~ 1 and ~ 2 agree with what reported by both Foucaud et 
al. (2010) and Wake et al. (2011). We also note that, even 
though direct comparisons are more difficult given the way 
data are presented, at z ~ 1 our findings are also in agree- 
ment with the results from Leauthaud et al. (2011) who add 
galaxy lensing to their analysis, and with those of Meneux 
et al. (2008). 

Although the intrinsic limitations of dealing with a flux- 
limited catalogue mask possible strong conclusions that can 
be drawn on the evolution of the luminous-to-dark mat- 
ter ratio in galaxies with redshift /galaxy mass, however the 
pieces of information that we have gathered in this section, 
when combined with the above results taken from the litera- 
ture, suggest that at higher redshifts PEP-selected galaxies 
were more actively forming stars than their lower-z counter- 
parts. At the same time, they also contained a lower frac- 
tion of their mass in (evolved) stars. These findings then 
converge at indicating an evolutionary picture whereby the 
bulk of stellar formation within massive galaxies took place 
at 2 > 2 (in excellent agreement with the results on the cos- 
mic evolution of the star formation rate - e.g. Madau et al. 
1996). The intensity of this dramatic process then decreases 
as we approach the local universe, and already by z ~ 1 star 
formation activity within massive galaxies is largely reduced 
as most of the stars within such galaxies have already been 
formed. 



8 CONCLUSIONS 

We have presented the first direct estimates of the 3D corre- 
lation properties of far-infrared-selected sources up to red- 
shifts 2 ~ 3. This has been possible thanks to the Pacs Evo- 
lutionary Probe (PEP) survey of the GOODS South field 
performed with the PACS instrument onboard the Herschel 
Satellite. 550 and 502 sources were detected respectively 
in the 100/xm and in the 160pm channels down to fluxes 
Sioo^m = 2 mjy and Sieo/jm = 5 mjy, cuts which ensure 
>80% completeness of the two catalogues. 

These two samples have then been cross-correlated with 
the MUSIC (Santini et al. 2009) catalogue in order to pro- 
vide counterparts at other wavelenghts to the PEP sources 
and, where possible, to also assign them a (either photo- 
metric or where available spectroscopic) redshift. This was 
possible in more than 65% of the cases, independent of in- 
frared flux. This apparent incompleteness in the success rate 
for redshift determination is merely due to the fact that the 
PEP survey covers an area which is larger than those probed 
by surveys at smaller (mostly optical and near-IR) surveys. 
In fact, the above percentage rises to ~ 95% in the inner 
portion of the GOODS-S, i.e. virtually all PEP sources are 
endowed with a redshift estimate. 

Sources blindly detected at 100/xm and 160pm above 
the 80% completeness limit have then been used to calcu- 
late the angular two-point correlation function w(9) of far- 
infrared galaxies over the whole redshift range spanned by 
the data. Under the assumption of a power-law behaviour 
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w(9) = A9 1 * , for a fixed value of 7 = 1.8, we find an am- 
plitude A ~ 1.2 • 10 , identical within the uncertainties at 
both frequencies. By using the redshift distributions derived 
from the MUSIC catalogue, we then obtain values for the co- 
moving correlation length ro ~ 6.3 Mpc and ro ~ 6.7 Mpc, 
respectively at 100/im and 160/im. Within the halo bias ap- 
proach (Mo & White 1996), such a clustering strength cor- 
responds to dark matter haloes of masses M > 10 12 ' 4 M Q . 

These values are in excellent agreement with those of 
Gilli ct al. (2007), obtained for a sample of Luminous In- 
frared Galaxies selected at 24/im in the same GOODS South 
field. Also, the number densities of their sources and of 
our far-infrared galaxies coincide, indicating that these two 
classes of objects refer to the same galaxy population. 
It is also worth noticing that our results are compatible with 
those derived at 250/im by Maddox et al. (2010) and Cooray 
et al. (2010) in the case of SPIRE-selected sources. In par- 
ticular, Cooray et al. (2010) find values for the dark halo 
masses which are in excellent agreement with our estimates. 

With the aim of studying in greater detail the large- 
scale structure traced by actively star-forming galaxies in 
the redshift range where most of the cosmic star-formation 
activity is confined, we have then selected sources from the 
100/im and 160/im catalogues in the z = [1.7 — 2.6] inter- 
val. An analysis of the de-projected two-point correlation 
function in this case gives ro ~ 19 Mpc and ro ~ 17.4 Mpc 
at the two frequencies of interest. By using once again the 
halo bias model, the above values correspond to halo masses 
M > 1O 13 ' 7 M . 

These are extremely high values and well match those found 
for similarly star-forming galaxies selected at other wave- 
lcnghts in the same redshift range (e.g. Magliocchetti et al. 
1999; Arnouts et al. 2002; Farrah et al. 2006b; Maglioc- 
chetti et al. 2007; 2008; Brodwin et al. 2008; Weiss et al. 
2009; Foucaud et al. 2010; Hartley et al. 2010; Wake et al. 
2011) and for quasars at earlier epochs (Shen et al. 2007). 
Furthermore, by combining our clustering results with the 
information on stellar masses for galaxies in the GOODS-S, 
we find that the ratio between average values of the stel- 
lar mass < M f > at different epochs increases when go- 
ing from z ~ 1 to z ~ 2 by about a factor 10 less than 
what happens in the case of the dark matter haloes host- 
ing such galaxies (M»/M m i n (< z >~ 1) ~ 4 • 10 _1 vs. 
M*/M min (< z >~ 2) ~ 4 • 10~ 2 ). When compared with 
recent results taken from the literature (e.g. Foucaud et al. 
2010; Wake et al 2011; Moster et al. 2010 to mention a few), 
our findings converge at confirming the evolutionary picture 
of downsizing wherby massive galaxies at z ~ 2 were more 
actively forming stars than their z ~ 1 counterparts and at 
the same time also contained a lower fraction of their mass 
in (evolved) stars. This is also in agreement with the most 
recent results on the evolution of the cosmic star formation 
rate (cfr. e.g. Gruppioni et al. 2010) and indicates that the 
bulk of stellar formation within massive galaxies took place 
at z > 2. The intensity of this dramatic process then de- 
creases as we approach the local universe, and already by 
z ~ 1 star forming activity is largely reduced as most of the 
stars within such galaxies have already been formed. 

A deeper investigation of the PEP maps indicate that 
the strong clustering signal detected for z ~ 2 PEP galaxies 
is due to the presence, more visible at 100/im than in the 
other band, of a wide (at least 4 Mpc across in projection), 



M > 10 14 Mq, filamentary structure which includes more 
than 50% of the sources detected above the 80% complete- 
ness level at z ~ 2. 

Such an overdensity, already detected in Salimbeni et al. 
(2009) but with a lower statistical significance, extends all 
along the GOODS-S and seems to be made of a more diffuse 
structure at z ~ 2.4, 'bracketed' at the two ends of its length 
by two more concentrated clumps set at redshifts z ~ 2.2 
and z ~ 2. The total number of 100/im-selected galaxies 
down to the PEP, Sioo^m ^ 1 mjy flux limit and residing 
in the overdensity is 28. We stress that this sample is not 
expected to be complete, both for geometrical reasons (fil- 
ament possibly extending beyond the edges of the surveyed 
area) and in terms of flux limits (completeness of the 100/im 
band just ~10% at ~1 mjy). 

It is nevertheless interesting to compare some of the 
properties of the galaxies which reside within the overden- 
sity with those of more isolated or 'field' sources, otherwise 
selected with the same criteria. For instance, a comparison 
between the distribution of bolometric luminosities for these 
two classes of objects shows a trend for galaxies belonging to 
the filamentary super-structure to be more luminous than 
field sources. If confirmed by analyses performed on larger 
datasets, this result also implies a higher star formation rate 
in proto-cluster objects as compared with isolated galaxies, 
finding which is likely due to the joint effect of large reser- 
voirs of gas available at high redshifts in deep potential wells 
such as those associated to large overdensities and the en- 
hanced rate of encounters between sources favoured by their 
proximity. 

As a last consideration, we note that extended proto- 
clusters of star-forming galaxies at z ~ 2 seem to be 
relatively common events, especially when found in asso- 
ciation with radio sources (see e.g. Pentericci et al. 2000; 
Steidel et al. 2005; Andreani, Magliocchetti & de Zotti, 
2009; Hatch et al. 2010 and references therein). However, 
a systematic study of these objects is currently limited 
by the capabilities of the state-of-the-art instruments 
which either sample wide but shallow fields or concentrate 
on deeper/pencil-beam- like surveys. Deeper/wider area 
surveys in the far-IR regime such as those planned for 
the SPICA mission (SPICA Study Team Collaboration, 
2010) will be able to provide a more statistically significant 
census of these highly dense regions so to characterize the 
interplay between formation and evolution of the large-scale 
structure and of the galaxies that trace it right at the epoch 
when most of the cosmic action takes place. 
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